Listing of the Claims 

The listing of claims that follows will replace all prior versions in the application. 

1 . (Currently Amended) A method of comparing the semantic content of two or more 
documents, comprising: 

accessing two or more documents; 

performing a linguistic analysis on each document; 

outputting a quantifi e d r e pr e s e ntation of th e o e mantic cont e nt of e ach docum e nt; and 
comparing th e quantifi e d r e pr e s e ntations using a d e fin e d algorithm. 

outputting a semantic vector for each document, said semantic vector having 

multiple components, wherein each component of said semantic vector has at least: 

a word or phrase appearing in the document or a synonym of said word or 

phrase; 

a weighting factor associated with said word or phrase or synonym; and 
a frequency value. 

2. (Previously Presented) The method of claim 1, wherein the linguistic analysis comprises 
sentence analysis. 

3. (Previously Presented) The method of claim 2, wherein the sentence analysis comprises 
a syntactic analysis and a semantic analysis. 

4. canceled. 

5. canceled 
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6. (Currently amended) The method of claim & 1, wherein each component of the semantic 
vector can have multiple dimensions. 



7. (Currently amended) The method of claim 6 1, wherein each component of the semantic 
vector includ e s on e or mor e t e xt valu e o further comprises a subordinate concept 
value. 

8. canceled 

9. canceled 

1 0. (Currently amended) Th e m e thod of claim 8, wh e r e in e ach compon e nt of th e s e mantic 
v e ctor has two valu e s: 

a word or phras e app e aring in the docum e nt or a synonym of said word or phras e ; and 
a w e ighting factor associat e d with said word or phras e or synonym 
The method of claim 1, wherein some of components of the semantic vector have 
(main term - subordinate term pairs} as their first value. 

1 1 . (Currently amended) The method of claim 4 1, wherein the semantic vector is a multi- 
dimensional vector defined by the content of a semantic net. 
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12. (Previously Presented) The method of claim 11, wherein the content of the semantic net 
is augmented by relative weights, strengths, or frequencies of occurrence of the features 
within the semantic net. 

13. (Previously Presented) The method of claim 1, wherein the output of said defined 
algorithm is a measure of at least one of semantic distance, semantic similarity, semantic 
dissimilarity, degree of patentable novelty and degree of anticipation. 

14. (Currently Amended) A method of comparing two or more documents, comprising: 
linguistically analyzing two or more documents; 

generating a semantic vector associated with each document; and 
comparing the semantic vectors using a defined metri c, wherein said metric measures 
the semantic distance between two documents as a function of the relative 
frequencies of common terms and of common {main term-subordinate term pairs} 
between the two documents. 

15. (Canceled). 

16. (Currently Amended) The method of claim 4414, wherein a common term between two 
documents includes two terms that are synonyms. 

17. (Previously Presented) The method of claim 14, wherein one or more of said two or 
more documents are located using an autonomous software or 'bot program. 
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1 8. (Previously Presented) The method of claim 1 7, wherein the 'bot program: 
automatically analyzes each document in a defined domain or network by 
executing a series of rules and assigning an overall score to the document. 

19. (Previously Presented) The method of claim 18, wherein all documents with a score 
above a defined threshold are linguistically analyzed. 

20. (Previously Presented) . The method of claim 14, wherein the semantic vector is a 
quantification of the semantic content of each document. 

21 . (Previously Presented) The method of claim 14, wherein the semantic vector can have 
multiple components, and each component can have multiple dimensions. 

22. (Previously Presented) The method of claim 14, wherein each component of the 
semantic vector has a word or phrase appearing in the document or a synonym of said 
word or phrase; and 

at least one of a weighting factor associated with said word or phrase or synonym and a 
frequency value. 

23. (Currently Amended) A system for comparing two or more documents, comprising: 
a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document; 
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a semantic quantifier, arranged to output a quantified representation of a semantic content 
of each document; and 

a comparator, arranged to compare the quantified representations using a defined 
algorithm , wherein said defined algorithm measures the semantic distance between 
two documents as a function of the relative frequencies of common terms and of 
common {main term-subordinate term pairs! between the two documents. 

24. (Currently Amended) A system for comparing two or more documents, comprising: 
a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document; 

a semantic vector generator, arranged to output a semantic vector associated with each 

document; and 

a comparator, arranged to compare the semantic vectors using a defined metric , wherein 
said metric measures the semantic distance between two documents as a function of 
the relative frequencies of common terms and of common {main term-subordinate 
term pairs} between the two documents. 

25. (Canceled). 

26. (Currently Amended) A computer program product comprising a computer usable 
medium having computer readable program code means embodied therein, the computer 
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readable program code means in said computer program product comprising means for 

causing a computer to: 

access two or more documents; 

perform a linguistic analysis on each document; 

output a quantified representation of a semantic content of each document; and 
compare the quantified representations using a defined algorithm , wherein said defined 
metric measures the semantic distance between two documents as a function of the 
relative frequencies of one of common terms and common {main term-subordinate 
term pairs! between the two documents. 

27. (Currently Amended) A computer program product comprising a computer usable 

medium having computer readable program code means embodied therein, the computer 
readable program code means in said computer program product comprising means for 
causing a computer to: 

linguistically analyzing two or more documents; 

generating a semantic vector associated with each document; and 

comparing the semantic vectors using a defined metric , wherein said metric measures 

the semantic distance between two semantic vectors as a function of the relative 

frequencies of one of common terms and common (main term-subordinate term 

pairs! between the two documents. 
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28. (Previously Presented) The computer program product of claim 27, wherein the 
computer readable program code means in said computer program product further 
comprises means for causing a computer to: 

identify one or more of said two or more documents using an autonomous software or 
'bot program. 

29. (Previously Presented) The computer program product of claim 27, wherein said 'bot 
program automatically analyzes each document in a defined domain or network by 
executing a series of rules and assigning an overall score to the document. 

30. (Previously Presented) The computer program product of claim 27, wherein the semantic 
vector is a quantification of the semantic content of each document. 

3 1 . (Previously Presented) The computer program product of claim 27, wherein the output of 
said defined metric is a measure of at least one of semantic distance, semantic similarity, 
semantic dissimilarity, degree of patentable novelty and degree of anticipation. 

32. (Canceled). 

33. (New) A system for comparing two or more documents, comprising: 
a document inputter, arranged to access two or more documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document; 
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a semantic vector generator, arranged to output a semantic vector associated with each 
document; and 

a comparator, arranged to compare the semantic vectors using a defined metric, wherein 
said defined metric is one of: 

[Sqrt(f 1 2 + f2 2 + f3 2 + f4 2 + + f(N-1 f fN 2 )/n] * 1 00, 
wherein f is a difference in frequency of a common term between two documents and n is 
the number of terms those documents have in common; or 

Sqrt(sum((w-Delta) A 2 * w-Avg)) / (Log(n) A 3 * 1000), 
wherein w-Delta is the difference in weight between two common terms, w-Avg is the 
average weight between two common terms, and n is the number of common terms, 
between two documents. 

34. (New) A method of comparing two or more documents, comprising: 
linguistically analyzing two or more documents; 
generating a semantic vector associated with each document; and 
comparing the semantic vectors using a defined metric, wherein said defined metric is 
one of: 

Sqrt(f1 2 + f2 2 + f3 2 + f4 2 + + f(N-1 f fN 2 )/n] * 100 

wherein f is a difference in frequency of a common term between two documents and n is 
the number of terms those documents have in common; or 
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Sqrt(sum((w-Delta) A 2 * w-Avg)) / (Log(n) A 3 * 1000), 



wherein w-Delta is the difference in weight between two common terms, w-Avg is the 
average weight between two common terms, and n is the number of common terms, 
between two documents. 

35. (New) A computer program product comprising a computer usable medium having 
computer readable program code means embodied therein, the computer readable 
program code means in said computer program product comprising means for causing a 
computer to: 

access two or more documents; 

perform a linguistic analysis on each document; 

output a quantified representation of a semantic content of each document; and 
compare the quantified representations using a defined algorithm, wherein said defined 
algorithm is one of: 

[Sqrt(f 1 2 + f2 2 + f3 2 + f4 2 + + f(N-1 f fN 2 )/n] * 1 00, 

wherein f is a difference in frequency of a common term between two documents and n 
the number of terms those documents have in common; or 
Sqrt(sum((w-Delta) A 2 * w-Avg)) / (Log(n) A 3 * 1000), 
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wherein w-Delta is the difference in weight between two common terms, w-Avg is the 
average weight between two common terms, and n is the number of common terms, 
between two documents. 
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